59 research outputs found

    Problemáticas y tendencias en la arquitectura de metadatos web

    Get PDF
    The purpose of this report is to show the difficulties that the effective unfolding of the Semantic Web involve and to identify the achieved agreements and realizations. The difficulties related to information retrieval in the current "syntactic" Web and the proposal of the W3C as a solution of these problems will be indicated. The recommendations and proposals of conceptual languages, valuing their contribution to the semantic Web, and the main discrepancies and questions opened up in the development of the specifications will be discussed. Other technologies and infrastructures, i.e. the application profiles and the schemes registries, important for the creation and use of metadata records in different communities, the discovery of resources and the establishment of correspondences among different formats that allow the interoperability among applications will be commentedEl propósito de este informe es poner de manifiesto las dificultades que entraña el despliegue efectivo del Web Semántico e identificar los acuerdos y realizaciones alcanzadas. Indicaremos las dificultades relacionadas con la recuperación de información en el Web "sintáctico" actual y la propuesta del W3C como solución a estos problemas. Comentaremos las recomendaciones y propuestas de lenguajes conceptuales, valorando su aportación al Web semántico, y las principales discrepancias y cuestiones abiertas en el desarrollo de las especificaciones. Se considerarán otras tecnologías e infraestructuras, los perfiles de aplicación y los registros de esquemas, importantes para la creación y utilización de registros de metadatos en diferentes comunidades, el descubrimiento de recursos y el establecimiento de correspondencias entre distintos formatos que permiten la interoperabilidad entre aplicaciones

    The ladder of Information Management towards the future.

    Get PDF
    The future of the Information Management discipline is presented like a evolutional ladder that includes Information Needs, Information Behaviour, Information Audit and Information Governance as the top position in this evolution

    Indización automática de artículos científicos sobre Biblioteconomía y Documentación con SISA, KEA y MAUI

    Get PDF
    This article evaluates the SISA (Automatic Indexing System), KEA (Keyphrase Extraction Algorithm) and MAUI (Multi-Purpose Automatic Topic Indexing) automatic indexing systems to find out how they perform in relation to human indexing. SISA’s algorithm is based on rules about the position of terms in the different structural components of the document, while the algorithms for KEA and MAUI are based on machine learning and the statistical features of terms. For evaluation purposes, a document collection of 230 scientific articles from the Revista Española de Documentación Científica published by the Consejo Superior de Investigaciones Científicas (CSIC) was used, of which 30 were used for training tasks and were not part of the evaluation test set. The articles were written in Spanish and indexed by human indexers using a controlled vocabulary in the InDICES database, also belonging to the CSIC. The human indexing of these documents constitutes the baseline or golden indexing, against which to evaluate the output of the automatic indexing systems by comparing terms sets using the evaluation metrics of precision, recall, F-measure and consistency. The results show that the SISA system performs best, followed by KEA and MAUI.Este artículo evalúa los sistemas de indización automática SISA (Automatic Indexing System), KEA (Keyphrase Extraction Algorithm) y MAUI (Multi-Purpose Automatic Topic Indexing) para averiguar cómo funcionan en relación con la indización realzada por especialistas. El algoritmo de SISA se basa en reglas sobre la posición de los términos en los diferentes componentes estructurales del documento, mientras que los algoritmos de KEA y MAUI se basan en el aprendizaje automático y las frecuencia estadística de los términos. Para la evaluación se utilizó una colección documental de 230 artículos científicos de la Revista Española de Documentación Científica, publicada por el Consejo Superior de Investigaciones Científicas (CSIC), de los cuales 30 se utilizaron para tareas formativas y no formaban parte del conjunto de pruebas de evaluación. Los artículos fueron escritos en español e indizados por indizadores humanos utilizando un vocabulario controlado en la base de datos InDICES, también perteneciente al CSIC. La indización humana de estos documentos constituye la referencia contra la cual se evalúa el resultado de los sistemas de indización automáticos, comparando conjuntos de términos usando métricas de evaluación de precisión, recuperación, medida F y consistencia. Los resultados muestran que el sistema SISA funciona mejor, seguido de KEA y MAUI

    Elaboración de índices para libros: perspectivas de actuación y formación profesional en España y Brasil

    Get PDF
    Book indexes are intended to allow the location of information through three elements that characterize them: a phrase or expression that summarizes part of the text, the locators (page number), and the arrangement in alphabetical order. For decades we have specific rules for the production and presentation of indexes. However the use of these standards only appears to be generalized among publishers of Anglo-Saxon countries. With the widespread expansion that is occurring in the provision and use of electronic books between individuals and academic institutions, it seems that in these new environments, a quality index can be an essential tool for fast and efficient access to information. Thus, we analyze the curriculum and, in particular, the subjects focused in indexing taught in the degrees of Library and Information Science in Spain and Brazil, to locate content related to the teaching of the production and publication of book indexes. Secondly, we intend to survey teachers who teach subjects related to this content to know their opinions about the need to incorporate this matter in academic curricula

    Arquitectura de la información: XML y WEB

    Get PDF

    Proyecto para un repertorio digital de grabados en libros (siglos XV-XVIII)

    Get PDF
    We present a project of bibliographical and documental research that consists in collecting, classifying, analyzing and digitally diffusing the illustrations contained in the printed matters (books, pamphlets, loose papers, etc.) from the fifteenth to the eighteenth centuries that are deposited in the libraries of the Region of Murcia. The general aim is the diffusion of information of difficult access, i.e. the engravings, taking advantage of the information technologies. We have established as partial objectives those of locating, identifying, digitizing, describing and disseminating the engravings. In order to achieve these objectives, a specific methodology has been devised, and accomplished in three successive phases: digitization, description and database management,. The result is a digital book engravings repository that allows for their retrieval by author, title, keywords, source, publisher, dates and other elements. This repository will improve the access to the information and it marks a new experience in metadata application for the description of images, whose effectiveness we will evaluate in a future phase.Presentamos un proyecto de investigación bibliográfica y documental que consiste en recoger, inventariar, analizar y difundir vía digital las ilustraciones contenidas en los impresos (libros, folletos, papeles sueltos, etc.) de los siglos XV al XVIII que se encuentran depositados en las bibliotecas de la Región de Murcia. El objetivo general es la difusión de una información de difícil acceso, los grabados, aprovechando las tecnologías de la información. Para ello se han establecido como objetivos parciales los de localizar, identificar, digitalizar, describir y difundir los grabados. Para conseguir estos objetivos se ha diseñado una metodología propia estructurada en tres fases sucesivas: digitalización, descripción y gestión de la base de datos. El resultado es un repertorio digital de grabados de libros que permite su recuperación por autor, título, descriptores, fuente, impresor, fecha y otros elementos. Este repertorio va a mejorar el acceso a la información y supone una nueva experiencia en la aplicación de metadatos en la descripción de imágenes, cuya eficacia habrá que evaluar

    Gestión de un fichero de autoridades en el marco de la biblioteca digital de historia del pensamiento político “Saavedra Fajardo”

    Get PDF
    Design and development of an authority file for the Digital Library “Saavedra Fajardo” based on the FRANAR model and following the guidelines of the Spanish Cataloguing Rules and the MARC21 format. The result is a virtual authority file with links to the bibliographical file in order to facilitate users’ searches and to help the cataloguers in achieving precise and reliable descriptions.Diseño y creación de un fichero de autoridades para la Biblioteca Digital Saavedra Fajardo basado en el modelo FRANAR y siguiendo las directrices de las Reglas de Catalogación españolas y el Formato MARC 21. El resultado es un fichero de autoridades virtual con enlaces al fichero bibliográfico para facilitar las búsquedas a los usuarios y que sirva a los catalogadores para dotar a las descripciones de precisión y fiabilidad

    The ladder of Information Management towards the future.

    Get PDF
    The future of the Information Management discipline is presented like a evolutional ladder that includes Information Needs, Information Behaviour, Information Audit and Information Governance as the top position in this evolution

    Automatic indexing of scientific articles on Library and Information Science with SISA, KEA and MAUI

    Get PDF
    This article evaluates the SISA (Automatic Indexing System), KEA (Keyphrase Extraction Algorithm) and MAUI (Multi-Purpose Automatic Topic Indexing) automatic indexing systems to find out how they perform in relation to human indexing. SISA algorithm is based on rules about the position of terms in the different structural components of the document, while the algorithms for KEA and MAUI are based on machine learning and the statistical features of terms. For evaluation purposes, a document collection of 230 scientific articles from the Revista Española de Documentación Científica published by the Consejo Superior de Investigaciones Científicas (CSIC) was used, of which 30 were used for training tasks and were not part of the evaluation test set. The articles were written in Spanish and indexed by human indexers using a controlled vocabulary in the InDICES database, also belonging to the CSIC. The human indexing of these documents constitutes the baseline or golden indexing, against which to evaluate the output of the automatic indexing systems by comparing terms sets using the evaluation metrics of precision, recall, F-measure and consistency. The results show that the SISA system performs best, followed by KEA and MAUI

    Técnicas y usos en la clasificación automática de imágenes

    Get PDF
    The production and generation of visual information through mobile phones and cameras is enormous. Also and mainly through remote sensing, through the acquisition of images of the earth's surface by means of planes, spacecraft and satellites that capture and serve data on meteorology, oceanography, geology, geography, geolocation, security, and so on. These image capture instruments generate visual information every day that cannot be manually processed, which is why various techniques and methods are used for the automatic extraction of useful knowledge. This literature review aims to understand the techniques and uses of automatic classification of images. In order to do this, the Scopus and WoS databases were used to locate documents on the automatic classification of images published between 2008 and 2018. The resulting records were searched for their full texts, carrying out a content analysis to find out the most recurrent techniques and their applications. As a result, it becomes evident that the three most commonly used techniques for the automatic classification of images are decision trees, neural networks and support vector machines, with the application of a wide variety of automatic classification, which seeks to automate repetitive processes, inspection and complex surveillance, urban control and development or recognition and assessment after natural disasters, among other aspects
    corecore